Task Space Tile Coding: In-Task and Cross-Task Generalization in Reinforcement Learning

نویسنده

  • Lutz Frommberger
چکیده

Exploiting the structure of a domain is an important prerequisite for being able to efficiently use reinforcement learning in larger state spaces. In this paper, we show how to benefit from the explicit representation of structural features in so-called structure space aspectualizable state spaces. We introduce task space tile coding as a mechanism to achieve generalization over states with identical structural properties. This leads to a significant improvement of learning performance. Policies learned with task space tile coding can also be applied to unknown environments sharing the same structure space and thus enable for a faster learning in new tasks.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Cycle Time Optimization of Processes Using an Entropy-Based Learning for Task Allocation

Cycle time optimization could be one of the great challenges in business process management. Although there is much research on this subject, task similarities have been paid little attention. In this paper, a new approach is proposed to optimize cycle time by minimizing entropy of work lists in resource allocation while keeping workloads balanced. The idea of the entropy of work lists comes fr...

متن کامل

Tile Coding Based on Hyperplane Tiles

In large and continuous state-action spaces reinforcement learning heavily relies on function approximation techniques. Tile coding is a well-known function approximator that has been successfully applied to many reinforcement learning tasks. In this paper we introduce the hyperplane tile coding, in which the usual tiles are replaced by parameterized hyperplanes that approximate the action-valu...

متن کامل

Progress in learning 3 vs. 2 keepaway

Reinforcement learning has been successfully applied to several subtasks in the RoboCup simulated soccer domain. Keepaway is one such task. One notable success in the keepaway domain has been the application of SMDP Sarsa(λ) with tile-coding function approximation [9]. However, this success was achieved with the help of some significant task simplifications, including the delivery of complete, ...

متن کامل

Assessment of manual material handling in a tile and ceramic factory using the National Institute for Occupational Safety and Health‎ equation in 2016

Background: Manual handling, lifting, or carrying of material is responsible for non-fatal injuries among employees in industries. It is the second most prevalent reported risk factor in workplaces that can lead to potential manual handling accidents and longer-term musculoskeletal disorders (MSDs). The aim of this study was the evaluation of manual material handling using the American National...

متن کامل

Hierarchical Reinforcement Learning in Computer Games

Hierarchical reinforcement learning is an increasingly popular research field. In hierarchical reinforcement learning the complete learning task is decomposed into smaller subtasks that are combined in a hierarchical network. The subtasks can then be learned independently. A hierarchical decomposition can potentially facilitate state abstractions (i.e., bring forth a reduction in state space co...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011